Bayesian correlated clustering to integrate multiple datasets
نویسندگان
چکیده
منابع مشابه
Bayesian correlated clustering to integrate multiple datasets
MOTIVATION The integration of multiple datasets remains a key challenge in systems biology and genomic medicine. Modern high-throughput technologies generate a broad array of different data types, providing distinct-but often complementary-information. We present a Bayesian method for the unsupervised integrative modelling of multiple datasets, which we refer to as MDI (Multiple Dataset Integra...
متن کاملCorrespondence Clustering: An Approach to Cluster Multiple Related Spatial Datasets
Domain experts are frequently interested to analyze multiple related spatial datasets. This capability is important for change analysis and contrast mining. In this paper, a novel clustering approach called correspondence clustering is introduced that clusters two or more spatial datasets by maximizing cluster interestingness and correspondence between clusters derived from different datasets. ...
متن کاملAutomatic Scale Selection for Clustering of Correlated High-dimensional Datasets
Clustering algorithms usually have one or more parameters that control the scale at which the algorithm looks at the data. We study the problem of simultaneously selecting parameter values for multiple datasets (clustering instances), some of which are a priori known to have similar values. We propose two optimization problems related to this task. We show that one of them is NP-hard, and give ...
متن کاملA Robust Desirability-based Approach to Optimizing Multiple Correlated Responses
There are many real problems in which multiple responses should be optimized simultaneously by setting of process variables. One of the common approaches for optimization of multi-response problems is desirability function. In most real cases, there is a correlation structure between responses so ignoring the correlation may lead to mistake results. Hence, in this paper a robust approach based ...
متن کاملKpax3: Bayesian bi-clustering of large sequence datasets.
Motivation Estimation of the hidden population structure is an important step in many genetic studies. Often the aim is also to identify which sequence locations are the most discriminative between groups of samples for a given data partition. Automated discovery of interesting patterns that are present in the data can help to generate new biological hypotheses. Results We introduce Kpax3, a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2012
ISSN: 1460-2059,1367-4803
DOI: 10.1093/bioinformatics/bts595